Model Selection

High-fidelity audio synthesis

# High-fidelity audio synthesis

Bigvgan 22khz 80band

BigVGAN is a universal neural vocoder achieved through large-scale training, capable of providing high-quality audio output for tasks such as speech synthesis.

Speech Synthesis

Bigvgan V2 44khz 128band 512x

BigVGAN is a universal neural vocoder based on large-scale training, capable of generating high-quality audio waveforms.

Audio Generation

Bigvgan V2 44khz 128band 256x

BigVGAN is a large-scale trained universal neural vocoder capable of high-quality conversion from mel-spectrograms to waveform audio.

Speech Synthesis

Bigvgan V2 22khz 80band Fmax8k 256x

BigVGAN is a large-scale trained universal neural vocoder capable of high-quality mel-spectrogram to waveform conversion. The v2 version accelerates inference through custom CUDA kernels and expands training data diversity.

Speech Synthesis

Bigvgan V2 22khz 80band 256x

BigVGAN is a general-purpose neural vocoder trained at scale, capable of generating high-quality audio waveforms from mel spectrograms.

Speech Synthesis

Bigvgan V2 24khz 100band 256x

BigVGAN is a high-performance neural vocoder that achieves high-quality audio synthesis through large-scale training, supporting multiple sampling rates and frequency band configurations.

Audio Generation

Musicgen Stereo Melody

MusicGen is a text-to-music generation model developed by Meta AI, capable of producing high-quality stereo music samples based on text descriptions or audio prompts.

Audio Generation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase